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Possible use of self-calibration to reduce systematic uncertainties in determining 
distance-redshift relation via gravitational radiation from merging binaries 
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By observing mergers of compact objects, future gravity wave experiments would measure the 
luminosity distance to a large number of sources to a high precision but not their redshifts. Given 
the directional sensitivity of an experiment, a fraction of such sources (gold plated - GP) can be 
identified optically as single objects in the direction of the source. We show that if an approximate 
distance-redshift relation is known then it is possible to statistically resolve those sources that have 
multiple galaxies in the beam. We study the feasibility of using gold plated sources to iteratively 
resolve the unresolved sources, obtain the self-calibrated best possible distance-redshift relation and 
provide an analytical expression for the accuracy achievable. We derive lower limit on the total 
number of sources that is needed to achieve this accuracy through self-calibration. We show that 
this limit depends exponentially on the beam width and give estimates for various experimental 
parameters representative of future gravitational wave experiments DECIGO and BBO. 



Establishing the nature of dark energy is a paramount 
objective of modern cosmology. A precise knowledge 
of cosmic distance to sources at moderate redshifts 
(z < few) is essential for success in this endeavor [l|. 
It has been suggested that gravitational radiation from 
merging binaries (neutron star (NS)-NS, NS-black hole 
(BH), and BH-BH) could be a 'standard siren' and a 
complementary means (to standard candles and rulers) 
for probing cosmic expansion 0-jy]- Indeed, a knowledge 
of the underlying physics of gravitational radiation from 
binaries could help establish the luminosity distance to a 
NS-NS binary to a precision of 2 per cent (ignoring, for 
the time being, the redshift-dependent error from grav- 
itational lensing). In order to serve as a cosmological 
probe however, the luminosity distance should be known 
as a function of redshift. Therefore, unlike other probes 
of distance, the main systematic uncertainty in this case 
is the identification (and redshift determination) of galax- 
ies hosting the binaries (see e.g. 043) • 

The space-borne gravitational wave observatory LISA 
[7[ is expected to achieve an angular resolution of about 
1' (for a detailed discussion see [J]). The volume bounded 
by this angle (inclusive of the redshift uncertainty from 
luminosity distance errors) is expected to contain roughly 
30 objects at z ~ 1. The directional sensitivity of next 
generation gravitational wave (GW) observatories such 
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as DECIGO Q, the Big Bang Observer (BBO) g and 
ASTROD [9] is likely to be even better (~ few arc sec- 
onds), in which case, in only a small fraction of cases 
would more than a single galaxy lie within the observa- 
tional beam pj|. 

The main source of uncertainty in using standard sirens 
to probe cosmic expansion comes from misidentifying 
galaxies hosting the standard sirens. Clearly, the larger 
the number of galaxies within the observational beam, 
the greater the chances for this to happen. Given the 
enormous potential of using gravity wave standard sirens 
to determine the nature of dark energy, it would clearly 
be desirable to minimize this source of systematics. One 
possible approach rests in determining an association be- 
tween merging binaries and the gas in the surrounding 
medium. Unique signatures of an 'afterglow' from such 
an event in the electromagnetic s pec trum could help in 
identifying the source galaxy (e.g. [U|). Statistical deter- 
mination of redshift using clustering properties of galax- 
ies constitute another possible approach [12 |. 

In this letter, we present a statistical iterative method 
to isolate the source of the GW signal. This formula- 
tion assumes no prior knowledge of the relation between 
the luminosity distance and redshift (DZ relation). The 
method we propose can iteratively improve upon the er- 
rors on DZ relation. Our method is briefly summarized 
as follows: Beams containing a single galaxy with red- 
shift consistent with the expected distance-redshift rela- 
tion would accurately and reliably portray the DZ rela- 
tionship. We shall call such sources 'gold plated' (GP) 
following [101 ] . The expected number of such sources de- 
pends crucially on directional sensitivity. If more than 
one galaxy falls within the beam we propose to rule out 
non-sources through an iteratively improved DZ relation. 
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FIG. 1: The maximum achievable accuracy on the DZ relation 
as a function of redshift using self-calibration for the BBO 
case. The solid line corresponds to perfect pointing and the 
dashed curve is for the BBO pointing accuracy, where AObbo 
is taken from [lQ]. The other curves correspond to degrading 
the pointing accuracy to IOAObbo, 50Af2BBO, IOOAAbbo, 
corresponding to curves with increasing values of SDl/Dl- 
The last two curves have a region in the middle where self- 
calibration does not work since the required number of sources 
from Eq. (fT2")l is larger than 3 x 10 5 . The BBO accuracy is 
almost as good as the case for perfect pointing, with small 
departures at intermediate redshifts. 



In our proposal we do not use other means of constraining 
the DZ relation such as SN la, since SN la systematics 
is likely to be far more complex than previously thought 
[13{ . Instead, we shall use GP sources to provide an inde- 
pendent approximate starting DZ relation which we iter- 
ate upon. To what precision this can be achieved depends 
upon the details of cosmology as well the experimental 
parameters. In this letter we investigate the efficacy as 
well as limitations of this iterative self-calibration. 

Redshift Error-boxr. The directional accuracy of a GW 
signal is determined by the experimental beam width 
AQ, giving rise to the possibility that several sources 
might lie within the beam. To single out an object as 
the unique source of the GW signal we either require a 
smoking gun signal or a criterion by which other objects 
in the beam can be ruled out as possible sources. In 
the presence of statistical uncertainty in DZ it is impos- 
sible to establish precisely the redshift z for a source of 
gravitational radiation. One must settle instead for an 
uncertainty z ± Az a where 



Az a ~ ^ '- = r) m $(z) , 



(1) 
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FIG. 2: The minimum number of sources iVo required to self- 
calibrate the DZ relation at a given redshift (see Eq. (|12|) 'l is 
shown as a function of the pointing accuracy AQ. The curves 
(right to left) correspond to z — 0.5, 1,2. The value of iVo 
rises steeply with the beam width since an increasing bin size, 
Azbin, necessary to suppress cosmology errors, conflicts with 
the requirement of unbiased calibration. (The latter cannot 
be satisfied if most GW sources are associated with multiple 
optical counterparts in the pointing beam.) 



a single source. The dimensionless standard deviation 
Vm( z ) = o~ ra {z)/Di, is partly due to instrumental noise 
and partly due to weak lensing. The dominant uncer- 
tainty is due to lensing and although lensing produces an 
asymmetric distribution of magnifications, for our pur- 
poses we shall assume that the distribution is described 
by a Gaussian with a dimensionless standard deviation 
Vwi{z) — 0.042z, derived from the results of [1J|. In this 
paper, we also add in quadrature a fixed value of instru- 
mental noise ?7i ns t = 0.02 to the lensing standard devi- 
ation to obtain rj^ = rfa + rjf nst . Az a also contains an 
important cosmology- dependent contribution, so that the 
redshift error box is finally given by (see Appendix) 



A.: 



y/gg 1 (z)+Crg(z) 
Dr. 



$(z) = T} m {z)${z) 



ritiz) 



T&x(z) 



1/2 



$(z) = (d\ogD L /dz) 1 and cr m (z) is the redshift de- 
pendent standard deviation in the luminosity distance to 



(2) 

where cr c (z) is the standard deviation in DZ reflecting 
uncertainty in our knowledge of the expansion history 
and r] c = a c /D L . 

Though multiple objects might lie within the beam, 
it may still be possible, with a small enough value of 
Az a , to single out a source purely on statistical grounds. 
However, since measurement errors on a single source 
are fixed, the only way to lower the redshift error box 
given by Eq. ©, is by reducing the second term in that 
equation. 

Occupation Number. The redshift range Az a together 



with the beam width, Af2, determine the expected num- 
ber of galaxies lying within the beam that are statistically 
consistent both with the approximate DZ relation as well 
as the measurement uncertainty. In order to calculate 
the occupation number n, defined as the mean number 
of galaxies that satisfy this criterion, we have adopted the 
number density of sources from [lOL Il5l Il6| ] . The mean 
number of galaxies in the redshift range 2Az a turns out 
to be 



n{z) 



8N a 

7 

h(z)y/Tr 



j) exp [-r 4 (z)] AQAz a , (3) 



where we have assumed a small Az a so the linear ap- 
proximation suffices. Here r(z) = J Q dz/h{z) is the c/Hq 
normalized coordinate distance, h{z) = H(z)/Hq and 
Nq = 1000 arcmin -2 is the projected number density 
of g alaxies consistent with the Hubble Ultra Deep Field 
Substituting Az a from Eq. @ we obtain 



v{z) 






1/2 



(4) 



where we have defined the minimum occupation number 
v(z) as the value of n when n c — 0, namely 



v{z) = 



h(z)y/ir 



r(z)exp[-r 4 (z)]?7 m Afi. (5) 



We shall assume that galaxies falling within the beam 
are distributed uniformly randomly, however, at the end 
of this letter we briefly discuss how clustering of galax- 
ies affects our analysis. Knowing the occupation num- 
ber n, the probability that there be k galaxies, apart 
from the source galaxy, within the beam is given by 
Pr(fc) = n k exp(— n)/k\. If there is only a single object in 
the redshift error-box then we shall assume that it is the 
source of the signal. The probability for such instances 
is given by Pr(0) = exp(— n). Clearly the limiting frac- 
tion of sources that cannot be resolved statistically is 
1 — exp(— v(z)), which for v(z) < 1 is approximately 
given by v(z). 

Method: A GP source would measure the DZ rela- 
tion with an accuracy r/ m at a redshift z. Let us con- 
sider a redshift bin, Azbin, centered at the redshift z. 
The total number of sources in this bin is given by 
AN(z) — N f(z)Azbm, where No is the total number of 
GW sources at all redshifts and /(z)Azbin is the fraction 
occurring in the bin Azbin- The value of N (NS-NS bi- 
naries) for GW space missions is expected to range from 
N ~ 10 3 (LISA) to A ~ 10 6 (BBO). Let us assume 
that there are ANqp(z) gold plated sources in the bin. 
These sources furnish a first estimate of the DZ relation. 
Since each source has a measurement error given by rj ra 
then clearly the zeroth error on cosmology is given by 

?7c0 = Vm/\/AN G p(z). 

We note that if there happen to be no GP sources 
in the redshift bin Azbm then t] c q can be obtained by 
fitting a dark energy model to the GP sources at other 
redshifts. However, with no a priori reason to assume 



a given behavior for dark energy, we advocate this self- 
consistently obtained DZ relation where each redshift is 
dealt with independently. 

Since we now have the zeroth order information about 
cosmology we can use n C Q to c alculate the occupation 
number no = v(z)y/l + lito/Vm- The zeroth knowledge 
of the DZ relationship resolves some sources to give the 
new value of resolved sources (GP sample and statisti- 
cally resolved sources) as AN^ olvcd — Pr(0)AA(z) = 
AN(z) exp(-rlo), and thus provides us with a first im- 
proved estimate r) c \ = r/ m / y/AN(z) exp(— no). With this 
refinement in rj c we can recalculate the occupation num- 
ber at the first iteration as 



77,1 = v(z) 



1 



AA(z)exp(-77 ) 



1/2 



(6) 



It is clear that iterating further we shall obtain the re- 
currence relation 



n l+1 = v(z) 



1 



1 



AN{z) exp(-n^) 



1/2 



(7) 



The iteration terminates when rij+i = fii, and therefore 
the saturation cosmological uncertainty, which is the sec- 
ond term inside parenthesis in the previous expression, is 
given by 



Vm 



1 



\/N f{z) Azbin exp (-n s 



(8) 



where we substituted for AN(z) to explicitly show the 
dependence of the saturation occupation number on the 
bin size and the subscript s denotes saturation value. 

The uncertainty decreases as Azbin increases. How- 
ever, since the bin size cannot be arbitrarily large, this 
ratio has a lower bound, which we parameterize as 
7 7c nin / 7 7m = niin(77 c /?7 m ) s = e(z), where the minimum 
value is obtained by choosing the largest allowed bin 
size. The occupation number in this case is given by 
n s = v{z)^J\ + e 2 (z), and using Eq. ([5J it follows that 
the bin size required to achieve this accuracy is given by 



Az biK 



exp v(z)y/l + e 2 (z) 



N f(z)e 2 (z) 



(9) 



Averaging sources in the bin Azu n introduces a sys- 
tematic bias rjl ys in the DZ relation. If the bin size is 
small we can assume that the sources are distributed uni- 
formly in the bin. By Taylor expanding the luminosity 
distance Dl, and taking its average over the bin we can 
easily obtain 



<fc 
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which is correct up to third order in Azbj n , and ( ) denotes 
averaging over the bin. If we demand n™ 111 = e(z)r] nl > 
n^ ys then we obtain 



e(z) exp 



:v(z)y/l + e*(z) 



> 



2A^N^f(z)V^r 1 1 J 5 



PI 
Dr. 
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(11) 



where we have substituted Azun from Eq ([9]). This for- 
mula encapsulates our main result and determines the 
limit of self-calibration. 

In FigQ]we plot SDl/Dl = e(z)i] nl for BBO, assuming 
an equality in the above expression i. e. assuming that the 
systematic term is equal to the random error. We have 
taken a flat ACDM model with il m — 0.3 for this figure. 
For this plot we have taken No, f(z) and AflBBo(z) from 
[10J . The same figure shows the accuracy obtainable for a 
degraded beam by applying a constant multiplying factor 
to the BBO beam value. 

Necessary condition for self- calibration: The left hand 
side of Eq. (fTTj) has a redshift dependent upper bound 
that we denote as b{z). Self-calibration would work only 
if the left hand side is larger than the right hand side, 
leading to 



with 



N > 
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1/2 



(12) 



If this condition is not satisfied then the bin size required 
is too large and the systematic term would dominate 
the random error. Therefore, in an experiment the to- 
tal number of sources Nq should satisfy this inequality 
to self-calibrate at a given redshift. In Fig [2] we plot this 
for a few redshifts as a function of Afi. 

Gain factor. We now give a rough estimate of the num- 
ber of sources resolved through this method. At a given 
redshift z a single source measures the luminosity dis- 
tance at a fractional accuracy n m , which we can take as 
the cosmological accuracy rj c , leading to the occupation 
number at the beginning to be no = v{£)yj\ + rfc/rj^ — 
\[2v{z). To obtain the resolved fraction at the end of it- 
eration we shall consider the extreme case when Nq 3> 1 , 
which along with Eq © gives e s ~ 0, therefore the sat- 
uration occupation number is n s ~ 0.414 v(z). Since 
for a given occupation number n, the fraction of total 
resolved sources is given by exp(— n), it is clear that the 
ratio of resolved sources at the end to that at the be- 
ginning (of iteration) is given by exp[0.414i/(z)]. Since 
v(z) is proportional to the pointing accuracy, the gain 
is an exponential function of the beam size. As an ex- 
ample, at z ~ 1.75, for BBO the minimum occupation 
number v = 0.63 [19J, giving a maximum gain factor of 
~ 1.3, while for DECIGO, assuming a beam linear size 
about a factor three worse, the gain factor is about ~ 11, 
showing the extreme sensitivity of gain to the directional 
sensitivity. 

Effect of clustering: In the discussion so far we have ne- 
glected the impact of galaxy clustering. The effect of the 
galaxy clustering can be taken into account by replacing 



n[i + ^itw 



(13) 



Here AV is the volume bounding the redshift and angu- 
lar error-box (Az and Af2) in the determination of the 
source. £ is the two-point correlation function of galaxy 
clustering and the integral extends over AV. 

Here we give estimates of the impact of clustering (sec- 
ond term of Eq. ([13])) for BBO and DECIGO configura- 
tions at z ~ 2. In the approximation, valid for these 
cases, in which the (comoving) linear size corresponding 
to angular resolution lj_ is much smaller than the distance 
corresponding to the radial distance lu of the (minimum 
i.e. when n c — > 0) redshift error, one can readily show 
that 

fi AV J V°- 2M P C / V 300M P C 



(14) 

Ah/h ~ {1,0.45} for the BBO and the DECIGO at 
z ~ 2. Eq. (fi"4"|) shows that this term scales inversely 
with {||, and therefore the effect of clustering would be 
less important in the beginning of the iteration process 
when r\ c could be appreciable but would be increasingly 
important as the maximum achievable precision is ap- 
proached. 

We have shown that by iterating over a self- 
consistently obtained DZ relation from resolved gravity 
wave sources it is possible to improve the luminosity dis- 
tance (DZ) relation and therefore isolate those sources 
that initially are unidentifiable (owing to multiple ob- 
jects in the pointing beam). 

However, due to the fact that in this process only the 
cosmological errors are reduced, the limiting resolved set 
crucially depends on the pointing accuracy at a given 
redshift. We have derived analytical expressions for the 
final accuracy reached on the DZ relationship as well as 
the condition for successful self-calibration (Eq rj2] and 
Figure 2). Our formulation will help future GW probes 
grapple with the issue of redshift measurement uncer- 
tainty due to the presence of multiple objects within their 
beam (Figure 1). A comprehensive analysis using simu- 
lated data to estimate cosmological constraints arising 
from future GW experiments will be presented in a com- 
panion paper. 



Appendix A 

We now derive the probability distribution function 
(PDF) for the source redshift given a cosmological model 



and a distance measurement to a GW source. We first 
derive the general formula and then specialize to the local 
approximation used in this paper. 

Let the measured distance be given by d m . To quantify 
cosmology errors we employ a linear model for the DZ 
relation, 



N 

D L (z,h) = ^2hifi(z) 



h T f, 



(Al) 



where h are the N parameters of the model, /, are N 
arbitrary functions of redshift, and we have defined f = 
[fx(z), fi(z), ■ ■ ■ > In(z)]. The unknown redshift is to be 
treated as a parameter of the model. The simplest choice 
is fi = z i ~ 1 , leading to a polynomial form for Dl(z). The 
parameters h are not known precisely and are described 
by the Gaussian distribution 
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exp 
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. (A2) 

where C is the covariance matrix, obtained by fitting the 
model to the resolved sources (or to other data sets), and 



ho are the best fit parameters. We employ bold lower case 
letters to denote column matrices and bold capital letters 
to denote second rank matrices. Employing the Bayes 
theorem we can write down the posterior probability for 
the parameters of the model as 



P(z,h\dm) OC P(d m \z,h)P(h)P(z) , 



(A3) 



where -P(h) is the prior PDF for the parameters h given 
by Eq. IA21 the prior P(z) is assumed to be flat, and 



P(d m \z,h) 
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The PDF in Eq. IA3I is a function of redshift z and h, 
therefore the posterior PDF for the source redshift can 
be obtained by integrating over h 



P(z\d m ) ex J P(d m \z, h)P(h) d N h , 



(A5) 



which can be expressed through variables g = h — ho and 
X = d m - h f as 
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r 



where the coefficients that do not contain z and g have 
been dropped, and we have defined a second rank matrix 
F = f ® f . If we define a matrix S = C _1 + F/er 2 ^ then 
this equation takes the from 



P{z\d m ) oc / exp 
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Translating the coordinate system in the parameter space 
g = u + u , where u is such that Su = x^/ a m^ we 
finally obtain 



P(z\d m ) oc / exp 
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Carrying out the integration and dropping all terms that 
do not depend on the redshift we obtain 



P(z\d m ) oc det(S) exp 



X 
2al 



fTg-l f 



(A9) 



normalized in the range z = to z — z max , and would, 
in general, produce an asymmetric distribution, due to 
the manner in which the cosmological errors scale with 
redshift. 

Local approximation: Since the PDF peaks at x = 0, 
we can define a redshift z$ through d m = h f (zq). If the 
cosmology is determined precisely then we can assume 
the redshift PDF to decline rapidly away from zq, and 
therefore we can replace h f (z) — h f (zo) + h f'(zo)(z— 
Zq), implying x = — h f {zq)(z — zq). Then, at the same 
level of accuracy we can replace f = f = f (z ) in f T S 1 f . 
To evaluate fj S _1 fo, we need an expression for S _1 . 
Noting that Fo is a rank one matrix, we have [l8( 



So _1 = C- 
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where g — trFoC/cr 2 ^. Noting that c 2 = trFoC, it can 
be readily shown that 



foSo fo 
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(All) 



Recalling that x — d m — h f , we find that the redshift 
probability distribution is centered at the redshift pre- 
dicted for the distance d m by the best fit model d(z; ho). 
Since the functions f are redshift dependent, the precise 
behavior of this function is complicated. P(z\d m ) can be 



The redshift probability distribution function can be now 
written explicitly as 



P(z\dr, 
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where a z = y/(a^ + a 2 c ) / D' L (z ) . 
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